Yushu Technology has been authorized a new patent, which enhances robot decision-making capabilities through a diffusion strategy, solving the problem of action understanding deviation. The core technology includes scene understanding, interaction prediction, and diffusion decision-making, aiming to enhance the robot's accurate perception of future states.
UBTECH's Youqi partners with Volcano Engine to integrate robotics with cloud AI, accelerating large model applications in industrial logistics through multimodal models, VLA, world models, and Doubao ecosystem.....
Xpeng Motors' CEO He Xiaopeng notes the AI industry is still nascent, with significant advances expected in physical AI, autonomous driving reaching near or full L4, and humanoid robots in the next three years.....
The Tnkr platform aims to solve the fragmentation issue in robot development by integrating hardware, software, data, and AI models into a unified open-source ecosystem. This allows developers to collaborate on building physical robot projects, changing the traditional 'puzzle game' mode of switching between different tools.
Create personalized Christmas cards instantly with AI. The first card is free. You can choose to have it printed and delivered or send it as an e - card.
AI-powered online tool for customizing a unique Android robot using a selfie or text prompt
Figure 03 is a general-purpose humanoid robot that uses Helix AI to adapt to home environments.
An embodied AI one-stop development platform released by Zhiyuan Robotics, covering the entire chain from data acquisition to model inference
Google
$0.7
Input tokens/M
$2.8
Output tokens/M
1k
Context Length
Anthropic
$7
$35
200
$2.1
$17.5
$21
$105
Alibaba
$3.9
$15.2
64
-
Bytedance
$0.8
$2
128
Tencent
$1
$4
32
Deepseek
$12
Openai
$1.75
$14
400
$525
Chatglm
Iflytek
$8
nvidia
Cosmos-Predict2.5 is a high-performance pre-trained world foundation model suite developed by NVIDIA specifically for physical AI. Based on diffusion model technology, it can generate high-quality images and videos with physical awareness based on text, image, or video input, providing world simulation capabilities for applications such as autonomous driving and robotics.
unsloth
Cosmos-Reason1 is a physical artificial intelligence model developed by NVIDIA. It has the ability to understand physical common sense and can generate embodied decisions through long-chain thinking reasoning. This model supports multimodal input (text + video/image) and outputs text, which is suitable for physical AI fields such as robotics and autonomous driving.
A Minecraft model control protocol server based on Mineflayer, providing a standardized JSON - RPC interface for AI agents to control Minecraft robots.
AI agent robot project based on multi-MCP servers
The ros2-mcp-server is a Python-based server that integrates with ROS 2 through the Model Context Protocol (MCP), enabling AI assistants to control robot movement through ROS 2 topics. It supports time-controlled movement commands and runs as a ROS 2 node, publishing geometry_msgs/Twist messages to the /cmd_vel topic.
RegenNexus UAP is a universal adapter protocol for connecting devices, robots, applications, and AI agents, providing low-latency, high-security communication, and supporting multiple hardware and MCP integrations.
A virtual travel robot environment based on the MCP server, allowing users to conduct virtual travel through Google Maps and interact with AI travelers.
The Minecraft MCP integration project enables AI assistants to interact with the Minecraft server and observe and operate in the game world through robots.
The glif - mcp - server is an MCP server for running AI workflows, supporting the management and running of glifs and robot tools from glif.app, and providing rich metadata access functions.
This project uses Spring AI and the MCP protocol to achieve natural language control of the mBot2 robot. It includes a Spring Boot application, an MQTT message queue, and a Python script on the robot side, supporting AI calls for instructions such as exploration and turning.
NiagaBot is a smart WhatsApp business automation robot based on Qwen3-Omni AI, supporting functions such as multimodal message processing, group management, batch broadcasting, and data analysis
This is an MCP service for Airbnb listing search and detail query, providing structured data and direct links, working without an API key and complying with robots.txt rules.
An intelligent conversational robot project based on large models, supporting multi - platform access and multiple AI models, with text, voice, image processing, and plugin expansion capabilities, and can customize enterprise AI applications.
Airbnb search and listing information desktop extension, providing advanced search filtering functions and detailed listing information retrieval. It supports various query conditions such as location search, date filtering, and price range, and complies with the robots.txt protocol to ensure compliant use.
Lark MCP is an AI assistant framework based on Feishu. It realizes function calls and message processing through reverse Feishu protocol, supports custom function registration and automatic matching calls. There's no need to configure a robot, and you can directly use your personal Feishu account as an AI assistant.
MinecraftBuildMCP is an MCP server that connects AI language models with Minecraft. It enables automated building, exploration, and interaction in Minecraft by controlling robots through natural language.